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Abstract 

Evolution on neutral networks of genotypes 
has been found in models to concentrate on 
genotypes with high mutational robustness, to 
a degree determined by the topology of the 
network. Here analysis is generalized beyond 
neutral networks to arbitrary selection and 
parent-offspring transmission. In this larger 
realm, geometric features determine mutational 
robustness: the alignment of fitness with the 
orthogonalized eigenvectors of the mutation 
matrix weighted by their eigenvalues. “House 
of cards” mutation is found to preclude the evo¬ 
lution of mutational robustness. Genetic load 
is shown to increase with increasing mutation 
in arbitrary single and multiple locus fitness 
landscapes. The rate of decrease in population 
fitness can never grow as mutation rates get 
higher, showing that “error catastrophes” for 
genotype frequencies never cause precipitous 
losses of population fitness. The “inclusive 
inheritance” approach taken here naturally ex¬ 
tends these results to a new concept of dispersal 
robustness. 

Keywords: genetic load — spectral gap — lethal 
mutagenesis — epigenetic mutation — dispersal load 

Based on theoretical considerations, Kimura [T] pre¬ 
dicted that the majority of evolutionary changes in the 
genome in mammals should consist of neutral muta¬ 
tions. Since this time, research has been focused on 
understanding the extent and nature of neutral genetic 
variation in organisms. One approach is to attempt 
to derive them from molecular first principles [H E]. 
The idea that neutrality may not merely be a derived 
consequence of molecular interactions, but actually an 
evolved property shaped by evolutionary dynamics, had 
its first manifestation in Fisher’s theory for the evolu¬ 
tion of dominance [J] , followed by Waddington who 
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proposed it as a consequence of stabilizing selection, 
and Conrad (cf. [3 [7]) who proposed it could result 
from higher-order epistatic mutations which smooth the 
adaptive landscape at other sites and consequently en¬ 
hance evolvability. Another mechanism proposed is that 
natural selection for adaptations robust to environmen¬ 
tal variation entails robustness to mutation as a generic 
side-effect [S]. 

An altogether different mechanism for the evolution 
of mutational robustness was proposed by Bornberg- 
Bauer and Chan [5] and van Nimwegen et al. mm, 
which is that evolution along neutral networks —sets 
of mutationally connected genotypes with equivalent 
fitnesses—would “concentrate at highly connected parts 
of the network, resulting in phenotypes that are rel¬ 
atively robust against mutations” m, and “Indepen¬ 
dent of functional fitness, topology per se can lead 
to concentration of evolutionary population at some 
sequences.” in]. 

Numerous citations of these papers repeat the finding 
that “populations will evolve toward highly connected 
regions of the genome space” (e.g. m, US])- However, 
neither the original papers nor subsequent studies (to 
my knowledge) provide analytic proofs of this observa¬ 
tion. With few exceptions [14] progress has been limited 
in identifying exactly which properties of neutral net¬ 
works determine their evolved mutational robustness. 
As is shown by the following example, more is going on 
than simply a “tendency to evolve toward highly con¬ 
nected parts of the network”. 

Figure [l] compares the equilibrium population distri¬ 
bution for two neutral networks that have 63 equally 
fit genotypes, and one inviable genotype off the net¬ 
work. The only difference between the networks is their 
mutational topology. One network is the set of all 64 
nucleotide triplets, while the other is the set of copy- 
number variants, from 1 to 64 copies. Two properties of 
the population equilibrium distributions on the neutral 
networks stand out: first, the equilibrium frequency of a 
genotype is determined not solely by how many neutral 
neighbors it has, but also by its position within the net¬ 
work. In the copy-number network, 62 of the genotypes 
are identical in having only neutral neighbors. Yet the 
stationary distribution increases 20-fold over these 62 


1 



Fundamental Properties of the Evolution of Mutational Robustness— Lee Altenberg 


2 


EQUILIBRIUM DISTRIBUTION 

STEPWISE COPY-NUMBER MUTATION NETWORK 



Figure 1: Equilibrium distributions and genetic loads 
for two neutral networks having 63 neutral genotypes 
and one lethal genotype (circled), under two mutation 
topologies: Mutation of a nucleotide triplet, and step¬ 
wise copy-number mutation. 

genotypes as they get more mutationally distant from 
the lethal genotype US]. The same phenomenon is seen 
in the trinucleotide network but to a lesser degree. Sec¬ 
ondly, dramatic differences are seen between the two 
neutral networks in the size of the genetic loads they 
maintain. The genetic load of the trinucleotide network 
is 44 times the genetic load of the copy-number network. 

The differences between the evolutionary outcomes 
on these two mutational graphs can only be the result 
of their different topologies, but what properties of their 
topologies? Numerous results from the field of spectral 
graph theory are applicable to this question; here no re¬ 
view is attempted. While graph theory has been widely 
used in models of mutation, the results typically impose 
the assumptions that mutations occur only once during 
replication, mutation is symmetric and occurs at a sin¬ 
gle rate, and fitness is either zero or a single other value 
(all assumptions in [fTl ITT]'). 

An alternative approach is taken here, which has 
proven valuable in previous work [m na [m ng ED]. 
The specific problem—evolution on neutral networks in 
nucleotide sequence space—is embedded into the larger 
space of problems, which includes arbitrary mutation 
patterns and arbitrary selection values. The generality 
forces one to seek the appropriate fundamental mathe¬ 
matical properties. 

It also expands the applicability of any results. Muta¬ 
tion is treated generally enough to apply to non-genetic 
information transmission, the principal example being 
organismal location, where the analog of mutation is 
dispersal. The results here thus automatically apply to 
dispersal load, and in the process define a new concept 
of dispersal robustness. Moreover, the generality of the 


treatment has the potential to apply widely to diverse 
mechanisms of “inclusive inheritance” pT] . 

With arbitrary fitnesses, one can no longer character¬ 
ize mutation neighborhoods simply by the fraction of 
mutations that are neutral, since now the distribution 
of fitness effects of mutation (DFE) includes advanta¬ 
geous or deleterious mutations, as well as neutral and 
lethal. The more general statistic is the expected fitness 
of offspring. Averaged over the population, one obtains 
an aggregate population mutational robustness — the 
degree to which the population maintains its fitness in 
the face of mutation pressure. A complete absence of 
mutational robustness occurs when the genetic load is 
the mutation rate, corresponding to the Haldane-Muller 
principle pgES]- Complete mutational robustness, on 
the other hand, would mean that the population suffers 
no genetic load as the mutation rate increases. A given 
adaptive landscape will fall somewhere between these 
two extrema. 

The main results found here are that the population 
mutational robustness at a mutation-selection balance 
is determined by abstract spectral properties: the align¬ 
ment between the fitnesses and the eigenvectors of the 
mutation matrix, weighted by their eigenvalues. 

This spectral analysis provide a lens through which 
to examine models of mutation, dispersal, an inheri¬ 
tance generally. Models of mutation and selection are 
usually constructed without appreciation of how the as¬ 
sumptions manifest in the eigenvalues and eigenvectors 
of the mutation matrix (or variation operator in the 
case of continuous variation). We will see, for exam¬ 
ple, that the widely encountered E3] “house-of-cards” 
mutation model |25j is incapable of supporting muta¬ 
tional robustness. The results here provide direction 
for analyzing a wide variety of models for their capacity 
to support the evolution of mutational robustness, and 
enable comparisons to be made between different kinds 
of mutation processes, including nucleotide base muta¬ 
tion, epigenetic mark mutation, and gene copy number 
variation. The comparisons extend to the spectral prop¬ 
erties of dispersal matrices, thereby unifying the results 
for genetic robustness and genetic load with those for 
“dispersal robustness” and dispersal load. 

1 The Setting 

The object of interest is the state of the population 
when it has converged in frequencies to an equilibrium 
under the following asexual, haploid evolutionary dy¬ 
namics, where the population is large enough to be 
treated as infinite and has discrete non-overlapping gen¬ 
erations (semelparity). The only event that changes the 
genotypes during reproduction is mutation; there is no 
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recombination. 


1 " 

'iit + 1 ) = WjZjit), ( 1 ) 


From Q, equilibrium frequencies must satisfy wZi = 
12^=1 or in vector form, 


M(/i) D z = (w^z) z = w z, 


( 4 ) 


where 


where 


n is the number of possible haploid genotypes, 

Zi (t) is the frequency of haploid genotype i in the pop¬ 
ulation at time t, Zj(t) > 0, •^*(^) = 

Wj is the fitness of genotype j, Wi > 0, 

is the probability, when the mutation rate is 
/r G [0,1], that parent of genotype j has offspring 
of genotype z, > 0, = 1 and 

w{t) = mean fitness of the popu¬ 

lation at time t, its Malthusian rate of increase in 
size. 


In the case of a multilocus genotype where muta¬ 
tion occurs independently at each of L loci, the mu¬ 
tation probabilities may be decomposed as products 
of the mutation probabilities at each locus, = 

where index the alleles at 

locus 

Mutation at each locus ^ may be parameterized by 
a mutation rate u and a mutation distribution, p/l- , 
given that mutation occurs: = (1 — + 

where da = 1 and 5ij = 0 when i ^ j. In 
matrix form this is 


= (1 --b(2) 

where each is an x column-stochastic matrix, 
is the number of possible alleles at locus and 
is the x identity matrix. 

The multilocus mutation matrix can be represented 
using the Kronecker product, 0, as 


+fiP 


?=i 


(«) 


( 3 ) 


Matrices of the form (1 —p,)I-|-p,P represent the situa¬ 
tion where only a single transforming event occurs dur¬ 
ing reproduction and will be referred to as single-event 
mutation matrices, while those of the form (|^ represent 
the situation where multiple independent transforming 
events occur during reproduction and will be referred 
to as multiple-event mutation matrices. In the limit 
of small p,, multiple-event matrices converge to single¬ 
event matrices, even when they model multiple loci. 


e is the vector of ones, = [l 1 • • • l] its trans¬ 

pose, 

D := diag[wi] is the diagonal matrix of fitness coeffi¬ 
cients, 

w = De is the vector of fitness coefficients, and 

w := Yl=i Dz = w^z is the mean fitness of 

the population. 

Equation Q shows that the mean fitness uJ = Dz is 
an eigenvalue of M(/j,)D. 

It is further assumed that M(/i) is irreducible when 
> 0, meaning any genotype j can mutate in some 
number of steps to any other genotype i. M(/i)D is irre¬ 
ducible as well if no genotype is lethal, so from Perron- 
Frobenius theory we know there is only one possible 
equilibrium, the strictly positive Perron vector z, and 
eigenvalue w is the Perron root and spectral radius of 
M(/r)D, referred to as r(M(/r)D) = w. 

If some mutations are lethal, then M(/i)D is re¬ 
ducible, and more interesting equilibrium behavior be¬ 
comes possible. The Frobenius normal form of a re¬ 
ducible matrix ((3^ pp. 74-77]) partitions the set 
of genotypes into blocks, such that the restriction 
of M(/r)D to any block n produces an irreducible 
matrix The blocks constitute quasis¬ 

pecies [37]. The block with the largest spectral radius 
r(M(^)MDM) among the genotypes present at equi¬ 
librium sets the mean fitness of the population. The 
analysis will therefore focus on such irreducible block 
matrices. 

1.1 Diploidy, frequency-dependent se¬ 
lection, recombination, assortative 
mating, and dispersal 

The mutational robustness of population at equilibrium 
can be analyzed without knowing whether this equi¬ 
librium is dynamically stable. In the case of haploid 
selection on a quasispecies without recombination, the 
system is essentially linear and the polymorphic equilib¬ 
rium is always stable. Relaxation of these assumptions 
to accommodate greater biological variety may alter the 
stability and existence conditions of equilibria but does 
not change the robustness analysis. The results here 
therefore apply to equilibria in systems with diploid and 
frequency-dependent selection, assortative mating, and 





Fundamental Properties of the Evolution of Mutational Robustness— Lee Altenberg 


4 


also recombination. In this case, letting T^-fc be the 
probability that parents j and k produce offspring i, 
one gets a a frequency-dependent version of Q: 

^ n n 

TijkfJJjk^jk ( 2 ): ( 5 ) 

^ ' j^l k^l 

1 "" 

= u)j{z) Zj{t), (6) 

' ' i=i 

where Wjk is the fitness of diploid genotype 
jk, Ajfc(z(t)) is the probability that the mate 
of genotype j is genotype k (frequency depen¬ 
dent), Wj{z) = YJk=i'^jkXjk{z), and Mij{z) = 
J2k=i'^ijkWjkXjk{z)/wj{z). Under random mating, 
Xjk{z{t)) = Zk{t). Selfing cannot be accommodated 
within the product structure in ([^ . In models of disper¬ 
sal, z(t) typically represents the density of organisms, 
without normalization by w. Carrying capacities then 
makes the growth rates Wi(z) density dependent. 

Because frequency dependence is relevant only to sta¬ 
bility, which is not considered, the analysis to follow 
applies to both 0 and 0. However, frequency depen¬ 
dence of both Mij(z} and Wj{z) makes them composite 
formal quantities, rather than biological essential quan¬ 
tities as in the case of mutation and haploid selection. 


Definition 1. At a mutation-selection balance, the 

population mutational robustness is defined to be 


Ro(M,D) 


w r(MD) 

maxi [rcj] max^ [ui^] 


( 8 ) 


The quantity r(MD)/maxi[r(;i] = Ro(M,D) = 1 — L 
is the complement of the classical genetic load, L [52J 
US]. We would like to know where Ro(M, D) falls within 
the possible range of values given the mutation rate in 
M. The maximum possible value is Ro(M,D) = 1, 
meaning no loss of fitness from mutation. The minimum 
possible value will be called the Haldane limit, which is 
(1 — for L loci. 

Result 2 (Haldane Limits on Mutational Robustness). 


Ro(M(/r), D) := > (1 _ 

i 

See also [28l eq. (2.3)], [Ml PP- 149-150, eq. (5.31)- 
(5.34)]. The proof for this result and those following are 
provided in the Supporting Information (5/). 

Robustness relative to the Haldane limit can be de¬ 
fined. 


1.2 Mean Fitness, Genetic Load, and 
Mutational Robustness 


Let us retrace the calculation of population mutational 
robustness in the neutral network model of van Nimwe- 
gen et al. m- Genotypes have only two possible fit¬ 
nesses, w, and 0. The mutational robustness of a geno¬ 
type with fitness w is the probability that its offspring 
have fitness w. The set of genotypes in the neutral 
quasispecies is Af. Then the mutational robustness of 
a genotype—it’s neutrality — is Vj := 
population neutrality at equilibrium z is the expecta¬ 
tion of Vj censused before reproduction, with parent 
frequencies wzijw for i G Af. Using Q, since wj = 0 
or Zj = 0 for j ^ Jf, 

n ^ ^ n 

E WjZj ^ WZj X—' X—^ 

w=y w^Zi = y wzi. 

■ 1 ^ ^ 

j = l jGTv 1=1 iGTv 


From these we see that the equilibrium average neutral¬ 
ity is 


V = 


E 



W 


E 


W 


E^*= 


w 

w 


r(MD) 

w 

(7) 


The concept of mutational robustness can be ex¬ 
tended beyond neutral networks by simply generalizing 
w to maxfwi]. 


Definition 3. Define relative mutational robust¬ 
ness as 


RRo(M(/r),D) 


r(M(/i)D) 

max[wi] 

i 




1 - (1 - 


e [0,1]. 


One further quantity of interest is the average fitness 
of mutant offspring, w'J"^ := Mij. 

Theorem 4. The average mutant offspring fitness in a 
population at mutation-selection balance is 


w 


-(M(m),D) :=^ 




Wi 


i=i 


' W 


= w 


1 - 


((->■)" Var[S] 


assuming the mutation distributions have pj^P=0 for all 
loci ^ and alleles i. 


This identity precisely expresses Fisher’s “deteriora¬ 
tion of the environment” argument [501 p. 42], that at 
any equilibrium, the average mutant fitness loss must 
exactly offset the gain in fitness from the fitness vari¬ 
ance. 
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2 Basic Results 


The core finding of m is that when genotype neutrali¬ 
ties vary over the neutral network, “the population neu¬ 
trality is typically larger than the network neutrality. 
... Thus, a population will evolve a mutational robust¬ 
ness that is larger than if the population were to spread 
uniformly over the neutral network.” While numerical 
examples are explored, no analytical results are pro¬ 
vided that define precisely when it holds. It is claimed 
in [m eq. (12)] that when the mutation matrix is sym¬ 
metric, i.e. M = IVR, this result can be obtained from 
Perron-Frobenius theory, but no proof is provided. A 
proof is here provided (in SI), but it requires more than 
Perron-Frobenius theory, specifically Rayleigh theory. 

Theorem 5 (Neutral Network Robustness). The equi¬ 
librium population robustness V = r(MD)/w is always 
greater than the average robustness over the neutral net¬ 
work, 


E[u] 




pL < 1/2, when the quasispecies has variation in fitness 
within its irreducible blocks. 

Theorems]^ andhave important implications for the 
theory of lethal mutagenesis. They show that the pop¬ 
ulation mean fitness will keep decreasing with greater 
mutation over at least the range 0 < p < 1/2. 

Corollary 8 (Sufficiency for Lethal Mutagenesis). If 
r(M(l/2)D) < 1 then for some p* < 1/2, the popula¬ 
tion will go extinct for all p > p*. 

The observation in m that “Perhaps surpris¬ 
ingly, the tendency to evolve toward highly connected 
parts of the network is independent of evolutionary 
parameters—such as mutation rate,” is solely due to 
there being only two fitnesses in the model, 0, and w. 

Result 9 (Single-Event Relative Robustness Increases 
with Mutation Rates). As the mutation rate increases 
with single-event mutation, if a quasispecies has more 
than one [only one] nonlethal fitness value, then the rel¬ 
ative mutational robustness RRo(M(/r),D) strictly in¬ 
creases [is constant] with mutation rate. 


if (1) the mutation matrix M is symmetric, (2) fitness 
off the network is 0, and (3) not all genotypes on the 
neutral network have the same mutational robustness. 


For multiple-event mutation, the relative mutational 
robustness may increase or decrease in as found in 
initial numerical exploration. 


When mutation is not symmetric, counterexamples 
can easily be constructed where the population evolves 
to an average robustness that is less than the average 
robustness over the entire neutral network. One ex¬ 
ample is where mutation is cyclic. Another example 
is where asymmetric mutation is symmetrizable (a re¬ 
versible Markov chain), but mutation is biased toward 
genotypes with few neutral neighbors (in SI). 

The next result adopts to quasispecies a theorem of 
Karlin, which has been available for over 30 years, and 
was independently proven in El- 

Theorem 6 (5.2 of Karlin [32]). The equilibrium ge¬ 
netic load within any quasispecies under single-event 
mutation strictly increases with mutation rate when 
there is any variation in fitness, i. e. for each irreducible 
block of loci K, r([(l — /j,)l[''l -|- /iPl'^ljDl'^l) strictly de¬ 
creases with p when ^ c I for any c > 0, or when 
r(pM) < 1. 

The same outcome has been shown for multiple-event 
mutation when mutation at each locus is reversible, by 
which I refer to M being the transition matrix of a 
reversible Markov chain, which is also equivalent to M 
being symmetrizable. 

Theorem 7 (Corollaries 1, 3 of (SSj). The equilibrium 
genetic load of a quasispecies under irreducible multiple- 
event mutation strictly increases with mutation rate 0 < 


3 Results for Reversible Markov 
Chain Mutation 

Theoremj^was provable due to the tractability afforded 
by reversible Markov chains. This same tractability car¬ 
ries over to the analysis of mutational robustness. For 
a reversible chain, M can be represented (|34l p. 33], 
[35], lag) as: 

M = Dj/^KAK^ 


where reversibility requires MijiTj = MjiTTj, and 

TT = (tti, ..., TTnY = Mtt is the stationary distribution 
for M, i.e. its Perron vector, 

1 /2 

T>t[ is the diagonal matrix of the square roots of tt^, 


K is the matrix of orthogonalized eigenvectors of M, 
KRT = K^K = I, i.e. for each j = l,...,n, 
Efci = 1, Er=i = Qiij^h, 

[K]j is the /-th column of K, 


or [K]i = D^'e, 


A := diag 


A,, 


is the diagonal matrix of eigenvalues 


Ai > A 2 P ‘ ‘ ‘ P uf IVI. 
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Applying this representation of M, we find that the 
spectral radius of MD is that of a symmetric matrix: 

r(MD) = r(Dy2KAK^ 

= r(KAKTD-i/2DDy2) 

= r(KAKTD) = r(AKTDK) 

= r(Di/2KAK^ 

This allows use of the Rayleigh-Ritz variational formula 
for the spectral radius ([SSI pp. 172-173], [37l pp. 176- 
180]), to give the first main result. 

Theorem 10 (Expression for the Spectral Radius). 
When mutation has the transition matrix of a reversible 
Markov chain, M = D^/^KAK^DTr^^^, then 

r(MD) = max D^/^KAK^ D^/^x 

X=1 

71/71 \ 2 

= max VaJ ) . (9) 

xv x=i \ ' j 

J — i- ^ I—I ' 

Here we see a fundamental property of the population 
mutational robustness: 

Corollary 11. Holding K and w, fixed, r(MD) is non¬ 
decreasing in each eigenvalue of the mutation matrix, 
A j, i — 2,..., n. 


3.1 Bounds Derived from the Muta¬ 
tional Eigenvectors 

Weinberger [38] introduced the idea of representing fit¬ 
ness landscapes using the eigenvectors of the matrix 
representing the mutation structure, and this “Fourier 
expansion” approach has been elaborated by numerous 
further studies (cf. [311 HD] ). The second main result is 
now presented. 

Theorem 12. (Lower Bound from the Fourier 
Representation of Fitness) Represent the fitnesses 
Wi as Wi — or w = Ka, where Oi is the 

Fourier coefficient for eigenvector [Kju Then 


r(MD) > 


1 


E TI 

.7 = 1 ^0 


i=i 


The Fourier coefficients Oj measure the ‘alignment’ 
between w and each column [K],, and a| is its “ampli¬ 
tude” [41]. From the theory of discrete nodal domains 
[13] . we know that as j increases from 1 to n, the eigen¬ 
vectors \K]j exhibit more sign changes between their 
mutationally connected domains, which can be consid¬ 
ered the ‘frequency’ of the eigenvector, and contribute 
to ruggedness in the fitness landscape. Greater rugged¬ 
ness of the landscape (weight on the Oj with larger j) 


corresponds to a smaller lower bound for the population 
mutational robustness, an effect that can appear even 
before there are multiple fitness peaks. 

Because Theorem [T2| applies to arbitrary fitnesses, it 
includes the mutational landscape modifiers described 
in |3S|, early envisioned in |7]. Note that the range 
of mutational robustness values possible from different 
arrangements of fitness values depends on the spread of 
the eigenvalues A 2 ,..., A„. 

Prior applications of Fourier expansions of landscapes 
(e.g. jSS], |13|) do not actually examine selection- 
mutation dynamics. Rather, only mutation occurs, and 
fitnesses do not enter into reproduction but are simply 
recorded in each generation to produce a time series. 
The main object of analysis has been the autocorrela¬ 
tion function of this time series when it is stationary, de¬ 
rived from the covariance between parent and offspring 
fitnesses. The next result ties the mutation-only covari¬ 
ance to the mutation-selection dynamics. 

Theorem 13 (Relation to Random Mutational Walks). 
For a stationary mutational random walk with reversible 
Markov chain transition matrix M, and stationary dis¬ 
tribution TT, let Covt^(W'p,Wo) be the xr-weighted co- 
variance between parent and offspring fitnesses. Then 
at a mutation-selection balance, wz = MDz, 

1 -k ^ 2 C0V,r [Wv ,Wo] . 

(w' tt) 

The expression states that the mean fitness of the 
population is increased above the 7r-weighted average 
fitness of the genotype space by the covariance between 
the normalized parent and offspring fitnesses in the mu¬ 
tational random walk. 

Corollary 14. When the eigenvalues o/M are positive, 
then Cov7r[VFp, Ho] > 0. 

3.2 Bounds Derived from the Muta¬ 
tional Eigenvalues 

Now we consider bounds on r(MD) that derive from the 
eigenvalues of M for any fitness landscape, to conclude 
the main results. 

Theorem 15 (Upper Bound from the Spectral Gap). 
Assuming 0 < /i < 1/2, then 

71 

WiTTj < r(MD) 

i=l 

WiTTi -k A2(M) max[7(;i] — 

i=l V 

where A 2 (M) = 1—^-k/r max A 2 (P^^^). 




KMD)> ^7 


'3 '^3 


V 7 = 1 






Fundamental Properties of the Evolution of Mutational Robustness— Lee Altenberg 


7 


Theorem when applied to a neutral network at 
biological mutation rates (which give Ai > 0 for all i) 
yields: 

Corollary 16 (Spectral Gap Bound for Neutral Net¬ 
works) . Let the neutral network be referred to as Af = 
{i : Wi = w > 0}. Then 

^ TTi < = Ro(M, D) < ^ TTj -t- A 2 ^ TTi. 

iGTV 

( 10 ) 

Here we see the explanation for the different muta¬ 
tional loads in Fig. [l] For the trinucleotide network, 
A 2 = 5/9, while for the copy-number variation network, 
A 2 = 0.999. Eq. ( [To| ) gives lower bounds on load as 
(1 — A2)/64. Comparison of the bounds with the actual 
loads give 0.007 < 0.0138,0.00002 < 0.0003, respec¬ 
tively, for the two networks. 

Next we shall see that the convexity of r(M(/i)D) in 
/r seen for single-event mutation in Result extends to 
multiple-event reversible mutation. 

Theorem 17 (Convexity of the Spectral Radius in /i). 
In the case where L loci mutate independently at rate /i, 
each in a reversible Markov chain, then for 0 < p < 1/2, 


r(M(/r)D) = 


max 

x'r X=1 




€=i 




is convex in p. 

Theorem has an important implication: “error 
catastrophes” for individual genotypes as the mutation 
rate increases never correspond to a precipitous decline 
in population mean fitness. The convexity of r(M(/i)D) 
in p means that no such declines are ever possible, that 
in contrast, r(M(/i)D) steadily declines at rates that 
never increase as p increases. These observation, noted 
by [351 mi m] based on specific fitness landscapes, are 
seen to hold for all possible fitness landscapes under 
reversible mutation. 


3.3 House-of-Cards Mutation 

The “house-of-cards” (HoC) mutation model intro¬ 
duced by [23 is a single-event mutation model where 
Pij = Ti for all j, which makes P = a rank-one 
matrix, and therefore Aj = 0 for i > 2. 

Result 18 (house-of-cards Mutation). Let'M.{p) = (1— 
p) -|- pP, where P = ttg^. Then 

n 

r(M(/r)D) < (1—^)max[t(;i] + pf 


Under biological conditions the term X]”=i is 
near zero, since it reflects the average fitness of geno¬ 
types under the stationary distribution of mutation — 
i.e. a genome made out of random nucleotides. In this 
case, Ro(M(/r)D) « 1 — /r, which means that house- 
of-cards mutation is incapable of supporting any mu¬ 
tational robustness. Note that HoC mutation at in¬ 
dividual loci does not create HoC mutation for the 
entire genome. But also note that HoC mutation at 
each locus combined with multiplicative non-epistasis 
(D = decomposes into multiple HoC sys¬ 

tems. Therefore, with multi-locus HoC mutation, multi¬ 
plicative epistasis is required for population mutational 
robustness to rise above the Haldane limit. 


4 Discussion 

Priority here has been given to presenting the new 
mathematical results on these decades old problems 
rather than to their biological interpretations or to par¬ 
ticular illustrations. But a few words are in order. The 
chief contribution is to show how the elevation of a 
population’s mean fitness above the Haldane limit at a 
mutation-selection depends on underlying spectral rela¬ 
tionships between the mutation structure and the array 
of fitnesses. The results are obtained in almost com¬ 
plete generality, encompassing arbitrary fitnesses and 
for some results, arbitrary mutation structures, and in 
others, reversible multilocus mutation. The results di¬ 
rectly extend to a novel concept of “dispersal robust¬ 
ness” . 

Infinite population theory involving the spectral ra¬ 
dius r(MD) and Perron vector z > 0 was originally 
developed in one or two locus theory, or dispersal mod¬ 
els, where the population size could easily exceed the 
number of genotypes or patches, and large populations 
were well approximated by infinite population models. 
In quasispecies theory, however, results were aimed to 
encompass the entire genome, entailing genotype spaces 
orders of magnitude larger than any possible popula¬ 
tion, rendering questionable the biological relevance of 
z and of r'(MD). 

Their relevance may possibly be retained in several 
circumstances: (1) when a small number of loci con¬ 
tribute to a multiplicative fitness component, (2) when 
transient dynamics produce a meta-stable mutation- 
selection balance [15], (3) when the dynamics can be 
approximated by a coarse-graining to give a “pheno¬ 
typic quasispecies” m. and (3) in a finite population 
multitype branching process. In the latter, noted in 
[35j . the expected number of offspring i produced from 
one parent j defines a matrix MD, and (MD)‘z(0) is 
the trajectory of expected numbers from an initial pop- 
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ulation, z(0). The long-term expected growth rate of 
the population is then r(MD). 

Each of these conditions merits further investigation. 
With these considerations as caveats, let us review the 
results here. 

The first main result (Theorem [T5| is that the spec¬ 
tral gap of the mutation matrix sets an upper bound 
on how much above than the Haldane limit the equi¬ 
librium population fitness can be. The spectral gap 
measures how rapidly mixing the mutation operator is. 
Theorem Theorem:Main:Eigenvalue thus shows that the 
more rapidly mixing mutation is, the small the muta¬ 
tional robustness at a mutation-selection balance. In 
Kingman’s well-known house-of-cards model, A 2 = 0 so 
the spectral gap is nearly maximal, so almost no muta¬ 
tional robustness is possible. Theorem provides an 
impetus to compare different kinds of mutation models, 
as well as dispersal models, for their spectral gaps and 
therefore how mutational robustness is limited. 

The second main result (Theorem 12) is that the 
alignment of the fitnesses with the orthogonalized eigen¬ 
vectors of the mutation matrix, weighted by the eigen¬ 
values, sets a lower bound on the population mutational 
robustness. This bound is low when the landscape is 
rugged and fitnesses align with the higher-frequency 
eigenvectors, and high when the landscape is smooth 
and fitnesses align with the low-frequency eigenvectors. 

Several additional results generalize results that have 
been found for special cases of fitness landscapes to ar¬ 
bitrary landscapes: Theoremsandshow that the ag¬ 
gregate population mutational robustness can only de¬ 
cline with increasing mutation rate. Theorem [T7] shows 
that this decrease happens at an at ever lessening rate 
as the mutation rate increases. So “error catastrophes” 
for genotypes can never cause the population fitness to 
plummet |44j . But high enough mutation rates will in¬ 
escapably drive the population to the fitness of a ran¬ 
dom genotype sequence, so for some rate less than this, 
lethal mutagenesis is assured. The issue of whether 
lethal mutagenesis can be evaded by the evolution of 
mutational robustness [45] is thus resolved: with high 
enough mutation rates, it cannot. 

These applications are to be seen as only initial il¬ 
lustrations of the potential usefulness of the analysis 
presented here. Other population dynamical processes 
may similarly be clarified by viewing them through this 
lens of their eigenvalues an eigenvectors. 
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Supporting Information 


Proofs of the Results 

For convenience, the results are restated along with 
their proofs. 

Result 2 (Haldane Limits on Mutational Robustness). 


Proof. The average mutant offspring offspring fitness 
from parent j, averaged over the population, cen- 
sused after selection, before reproduction, is 


w 


-(M(M),D) = ^rhr^=E 

i=i 




n v^n 


= E 


Ei=i 


i=i 


1-M, 


n 


w 


Ro(M(/r)D) := > (i _ 

max^WiJ 

Proof. At equilibrium, we examine Zi where Wi = 
max'Lj^zCi. Then, 

n 

wzi = MiiWiZi + MijWjZj 
i =2 

which implies 

n 

{w — Miiwi)zi = MijWjZj > 0 

i =2 

hence [HJ p. 62] 

Ro(M(/r),D) = —>Mn, 

Wi 

with equality if and only if for all j = 2 ,..., n either 
Mij = 0,Wj = 0, or Zj = 0. Further, Mii(/r) = 
so with equality 

if and only if = 0 for all loci □ 

See also [Ml eq. (2.3)], |29l PP- 149-150, eq. (5.31)- 
(5.34)]. 

Theorem 4. Letting the average fitness of mutant off¬ 
spring produced by parent j be 

the average mutant offspring fitness in a population at 
mutation-selection balance is therefore 


Recall that the multiple-event mutation rates are 


L L 

fiO 


{=1 


] 


Assuming Pj^j^ = 0 for all f, j, then Mjj = (1 — p,)^. 
Substitution gives 


^(M(/r),D) = ^ 




2 = 1 






l-(l-/r)^ ffj 


n -,,2i 


E WjZj ^ 1 ^ 3 /-, \ 

- m) 

... w . w 

1,3 — 1 3 — 1 

i-(i-M)^ 


Since 


Ti 2 ^ n 2 ± / 

y^WjZj I \/ 

y A = w > ^ = w 1 + Var[^ 

j=i w w \ W 


^w,z, - w(l + Var[^])(l - p) 
^' w 


hJ™(M(Ai),D) = 


= w — w 


l-(l-M)^ 
( 1 -m)^ 


Varf^] 


= w 


1 - 


-M^Varl^S] 


□ 


w 


-(M(/i),D) :=^ 




w 


Theorem 5 (Neutral Network Robustness). The equi¬ 
librium population robustness V = r(MD)/'u; is always 
greater than the average robustness over the neutral net¬ 
work, 


1 - AbMEvarlS] 




w 


EH = 


Eigw Ejgw ^ij 

Ei=l EjgAt 


assuming the mutation distributions have PiP=0 for all 
loci f and alleles i. 


if 


1. the mutation matrix M is symmetric, 

2. fitness off the network is 0 , and 
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3. not all genotypes on the neutral network have the 
same mutational robustness. 


Proof. Repeated use will be made of the identity 
E”=i fiWi/w for arbitrary values ft. We 
may write the network’s average robustness as 

1 EjgAA _ 1 Ez^i Ej=i 

Er=iE,gArAf,, -w e:=ie;=iM.,tu, 

1 e^DMDe 
w e^De 

We have r(DMD) = wr (MD) since (DMD)‘ = 
w*D(MD)‘ = 'u;‘(DM)*D. 

Since Wi € {ra, 0}, = wD, so hence 


EH = 


1 e^DMOe 1 (eTD)DMD(De) 


w 


De 




(eTD)(De) 


1 xT DMDx , , 

<^max- -==-- (11) 

x^O x' X 

= ^ r(DMD) = = Ro(M,D). 

w 

Equality holds if and only if D is a left eigenvector of 
DMD [ini Theorem 4.2.2 (Rayleigh)], in which case 


Ae^ D = (e^ D)DMD = DMD, 


and so D is also an eigenvector of MD with eigen¬ 
value X/w. This is equivalent to the condition that all 
genotypes on the network have the same neutrality, i.e. 
for all j e Af, c = = EEi or 

ce^D = DMD 

with A = c/w. Therefore, if there are any differences in 
the neutralities on the network, the inequality 0 is 
strict. □ 


Theorem 7 (Corollaries 1, 3 of [33]). The equilibrium 
genetic load of a quasispecies under irreducible multiple- 
event mutation strictly increases with mutation rate 0 < 
/i < 1/2, when the quasispecies has variation in fitness 
within its irreducible blocks. 

Proof. Corollaries 1, 3 of [33] show that 


the irreducible blocks, k, from the Frobenius normal 
form of the reducible matrix M(/r)D. The irreducible 
restriction r([M(/i)D]['‘l) has a unique equilibrium for 
each /r, and Corollaries 1 and 3 apply, meaning that if 
there is any variation in fitness among {wi : i € k}, then 
r([M(^)D]Al) decreases strictly in p,. The quasispecies 
may comprise more than one block, but its mean fitness, 
maxK{r([M(/r)D][''l)}, will therefore also be decreasing 
in p. □ 

Result 9 (Single-Event Relative Robustness Increases 
with Mutation Rates). As the mutation rate increases 
with single-event mutation, if a quasispecies has more 
than one [only one] nonlethal fitness value, then the rel¬ 
ative mutational robustness RRo(M(/r),D) strictly in¬ 
creases [is constant] with mutation rate. 

Proof. We examine the restriction of M(/r)D to an irre¬ 
ducible block M(/r)['‘lD[''l. In the neutral network case 
where Wi = w for all i G a, then 

r(M(^)MDM) ='u;r([(l -/r)l[''l -k/iPl'^')] 

= w[{l -p)+ ^r(pM)] = w[l + Ai(r(pM) - 1)], 

so with single-event mutation, L = 1, and 

RRo(M(rtM,DH) = + M 

P 

_ l-k//(r-(pM) _ 1) _ 1 + ^ 

= r(pM) < 1. 


which shows the relative robustness is a constant for all 

p. 

For brevity write f{p) := r(M(/i.)["’lDl''l)/maxi[u'i] 
and 


9(m) := RRo(M(m)I 


[k] ^ /(/l) - 1 + ^ 


Then 


g{pi)-g{p 2 ) = 


/(mi) - 1 /(M2) - 1 


Ml 


M2 


L 

r{M{p)T)) = r( (g)[(l - + ^iP^JD) (12) 

strictly decreases in p G (0,1/2) when D ^ c I for any 
c > 0, and when there is only one equilibrium for each 
p. In the case where there are lethal genotypes, there 
may be more than one equilibrium, corresponding to 
multiple quasispecies. In that case, one must focus on 


Let Pi = p > 0 and p 2 = hp, 0 < h < 1. 


g{p) - g{hp) 


hf{p) -h - {f{hp) - 1) 


ph 


(13) 


In the case where there are more than one nonlethal 
fitness in the irreducible block, D^''! is nonscalar (DI”! 
c for any c G K). In [19] it is shown, by applying 
“dual convexity” to theorems Cohen [50| and Friedland 
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|51| . that r([(l — jot'll) is strictly convex 

in /L when is nonscalar. Convexity means that for 
all 0 < p < 1, and ai,a 2 , 

(1 -p)/(ai) +Pfio!2) > f{{l-p)ai +pa2). (14) 

Let ai = 0 and 02 = p. We know that 


Applying this representation of M, we find that the 
spectral radius of MD is that of a symmetric matrix: 

r(MD) = r(Dy2KAK^ 

= r(KAK^ D^^/^DDy^) = r-(KAK^ D) 

= r(AKTDK) = r(Di/ 2 K;AK^ D^/y. 


/(O) 


r(DM) 

maxi=i„.._„ 


maxjg^ Wj 


< 1 . 


So [M] becomes 


1-P + pf{p) > (1 - p)/(0) + pf{p) > f{pp)- 


Now, let p = h. Using /(O) < 1 and (13) we get 


0<{l-h)f{0) + hf{p)- f{hp) 

<l-h + hf{p) - f{hp) = ph{gip) - g{hp)). 


Therefore g{p) = RRo(M(/r)[''l, strictly increases 
in 0 < ^ < 1 for any quasispecies k. □ 

Theorem 10 (Expression for the Spectral Radius). 
When mutation has the transition matrix of a reversible 
Markov chain, M = D^/^KAK^DTr^^^, then 

r(MD) = max 

X=1 

71/71 X 2 

= max Xj^JwlKjj ) . (15) 

x^x=l^^ / 

J — i- ^ l — \ ' 

Proof. The transition matrix of an irreducible reversible 
Markov chain can be represented ([331 P- 33] [33]) as 


Note the remarkable symmetry here: r(MD) = 
r(KAK^ D) = r(AK^ DK), which sandwiches one of 
two diagonal matrices, A and D, between the change- 
of-basis matrices K and K^. 

From this symmetric form, the theorem results imme¬ 
diately from application of the Rayleigh quotient char¬ 
acterization of the spectral radius of any symmetric 
real matrix S (|331 pp. 172-173], [371 pp. 176-180]), 
7 ’(S) = maxx 5 ,io(x^ Sx/x^ x). □ 

Remark 1. No assumptions on the irreducibility 
of M or MD enter here. In order for r(MD) to 
be the mean fitness of the population when MD is 
reducible, we must impose the additional assumption 
that the population contain a quasispecies k such that 
r(MMDM) = r(MD). 

Remark 2. The approach taken here may have util¬ 
ity for graph theory. One obtains bounds on the spectral 
radius of a graph by embedding it into a larger graph 
whose eigenvectors and eigenvalues are known, and pro¬ 
jecting those eigenvectors onto the graph. 

Theorem 12. (Lower Bound from the Fourier 
Representation of Fitness) Represent the fitnesses Wi 
as Wi = orw = Ka, where Oi is the Fourier 

coefficient for eigenvector [K]^. Then 


M = Dy^KAK^ 

where reversibility requires MijiTj = MjiTii, and 


r(MD) > 


E 71 

.7=1 




(16) 


1=1 3 


TT = (tti, ... ,7 r„)^ = Mtt is the stationary distribution Proof. The square-root terms y/wi in (15) can be “di¬ 
gested” by employing Xi = Cy/wi, where the constraint 
x^X = 1 implies c = 1/ y/ e^De. Then 


for M, i.e. its Perron vector, 

1 /2 

d;“ is the diagonal matrix of the square roots of tt,, 


K is the matrix of orthogonalized eigenvectors of M, 
KRT = K^K = I, i.e. for each j = l,...,n, 

Er=i Kf, = ELi = 0ifj^h, 

[K], is the j-th column of K, 

Aa = or [K]i = D^'e, 


r(MD) = max > Aj N Xi^/wlKi. 

x=i \ ^—i 




\i=l 


A := diag 


A,; 


is the diagonal matrix of eigenvalues 
Ai>A 2 >--->A„>—lofM, these inequalities 
coming from Perron-Frobenius theory. 


> C Aj ( y^ y/Wiy/WiKij 
1=1 \i=l / 

n / n \ ^ 

= c Aj ( wiKij j 


1=1 

1 


De 


e' DKAK' De. 


(17) 
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Substitution with the Fourier representation w 
De = Ka into (17) gives (16): 


be represented in terms of the Fourier representation of 
fitnesses, w = Ka, as 


r(MD) > e^DKAK^De 

e' De 

= - a^ K^KAK^Ka 

1 1 ^ ^ 

^ ^ ^ FT. X! 

2^j=i Wj Wj 

Several lemmas are needed for the proofs to follow. 
The first is a generalization of the result for graphs in 

m- 


Lemma 13 (Parent-Offspring Fitness Covariance). For 
a stationary mutational random walk with reversible 
Markov chain transition matrix M, and stationary dis¬ 
tribution TT, the TV-weighted covariance between parent 
and offspring fitnesses (W-p, Wq respectively) is 

n / ^ 

Cov^[lFp, Wo] = ^ Aj I ^ WiTvy^Kij 
1=2 \i=l 



1 "■ 

CoVe/„ [Wp, Wo] = - Xitti- 

' n 


2 = 2 


Proof. When M is symmetric, tt = e/n, [K]i = ef^fn, 
and K = ^/ri , and using the Fourier representation 
w = Ka, one gets 


Wj = -w = Ka = a.^/n = ai y/n. 


1=1 

Substituting the above into , one obtains: 
CoVe/„[Wp, Wo] = w^(Dy2KAK^Dy 2 )w-(w^ 7 r) 

= w^(KAK^ )vf/'/n? - (w^e)Vn2 

= a^K^KAK^Ka/n - (a^K^ ef/n^ 

= a^ Aa/n — (a^ jr? 




a? 


□ 


^ 2=1 


2 = 2 


Proof. 


Coy^[Wv,Wo] 

n n / ^ \/^ \ 

= H H - f H ) 

j — l 2=1 ^ j—l ^ ^ij = l ' 

= w^MDtt — (w^7r)(w^M7r) 

= (Dy^KAK^D.,(^/^)D7r — w^TTw^TT (19) 

= DDy^KAK^Dj/^De - DD^ee^DD^e 
= DDy^KAK^ )Dy^De 


= DDy^KAK^ - Dj/^ee^0^2)0^206 
= DDy2(KAK^ - [K]i[Kiy)Dy2De 


= DDy2(K diag 


0 , A 2 , ■ ■ ■ 


KT )Dy 2 De 


= H H Ay 
j=2 Vi=l 


( 20 ) 


□ 


The expression ( 18 ) simplifies when M is symmetric, 
to give the following, originally derived in 


Lemma 14 (Parent-Offspring Fitness Covariance Un¬ 
der Symmetric Mutation [39jb For a stationary muta¬ 
tional random walk with a doubly stochastic transition 
matrix M = IVR = KAK^, and stationary distribution 
TV — e/n, the parent-offspring covariance in fitness can 


Theorem 15 (Relation to Random Mutational Walks). 
For a stationary mutational random walk with reversible 
Markov chain transition matrix M, and stationary dis¬ 
tribution TV, let Cov 7 r(Wp,Wo) be the Tv-weighted co- 
variance between parent and offspring fitnesses. Then 
at a mutation-selection balance, wz, ~ MDz, 


'■(MD) > I ^WjTVj 
^ 1=1 


l-b 


(wTtt)' 


rCov^[Wp,Wo] 


Proof. Using (19), 


r(MD) = max 


xT (D1/2 kAK^ Di/2)x 


> 


e^ (D^D)1/2(D1/2 kAK^ Di/2)(D^D)i/2e 
e^ (D.n-D)e 

e^ DDy^KAK^ D^^De 


1 

W^TT 
.T 


= (w tt) 


[CoVtt (Wp, Wo] + (w TVf] 
1 


1 + 


(w^tt)' 


-Cov^[Wp,Wo] 


□ 


Corollary 16 (Condition for Positive Parent/Offspring 
Fitness Covariance). When the eigenvalues of M are 
positive, then Cov 7 r[Wp, Wo] > 0. 


Proof. When > 0 for all i, then all terms in the sum 
(18) are nonnegative. □ 
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Lemma 17. Since K is orthogonal, its columns form 

a basis to represent x = Kc, or Xi = 

eonstraint x = 1 = Kc = c^c implies c^c = 1 . 

Then 

n n n 

^ ^ ^i^ij — ^ ^ih^h)^ij 

2—1 2 — 1 h — 1 

n n n 

— ^ ^ c/j ^ ^ KijiKij — ^ ^ CfiSfij — Cj. 
h —1 i—l h —1 


Lemma 18. 


max x"^y = (y^y)^/^ min x^y = -(y^y)^/^ ( 21 ) 

X =1 X^ X =1 

Proof. Critical points of max^Tx^i x^y = maxx^o 
satisfy 

_ x^y ^ 

dxi x^X x^X (x^x)2 ’ 

which, for x^x = 1 , are solved by pi = Xifif' y), hence 
y^y = (x^y)^ so a/ y^y = |x^y| is the maximum, and 
—-y/y^y = — |x^y| the minimum. □ 

Theorem 15 (Upper Bound from the Spectral Gap). 
Assuming 0 < /r < 1 / 2 , then 


n 

WiTTi < r(MD) 

i=l 

n / 

< ^ WiTTi + A2(M) max[wi] - 
i=i \ 

where A 2 (M) = 1 —/r+/r^ rnax^A 2 (P^^^). 

Proof. The condition 0 < /i < 1/2 assures that all 
the eigenvalues of M(^)(^) are positive, and thus all 
Ai(M(^)) > 0. A series of inequalities are obtained, the 
last steps using the representation x = Kc from Lemma 
[TtI and Lemma [T 8 | 



r(MD) = 

n 

(xTd>/V/»)^ + J;a,(xTd'/^[K|,)= 


max 

xV x=l 


< max 

xV x=l 


= max 

xT X=1 


i=2 


(xTDi/27ri/2)2 + A2 ^(xTDi/2[K],f 

i =2 

( 1 -A 2 )(x^ Di/27ri/2)2 + 


A2 (xT Di/27ri/2)2 + ^(x^ D1/2[K], 

A J =2 


= max 

X=1 


( 1 -A 2 )(x ^+ ^(x^D 1 / 2 [K]j )2 

i=i 

< (I-A 2 ) max (x ^f + 


x"^ X=1 


A2 inax ^(x^ 

X ' X=1 

n n 

< ( 1 —A2)T^ WiTTi + \2To.ayi[wi] max \^(x^ [K]j)^ 

i x''’x=l 

2=1 J=1 

n n 

= ( 1 —A2) WiTTi + X2 max)^;^] max c| 

- ^ C C—1 . 

2=1 J=1 

n 

= (I-A2) ^ WiTTt + X2 max[uii]. 

i=l 

The lower bound, r(MD) > proven 

in [22 Corollary F.2.]. Here, a separate proof is pro¬ 
vided. 

r(MD) 


= max 

x'^ X=1 


> max 

xT X=1 


> max 

xV x=l 


n n 


(x^ x^y/uTjKjjY 

3=2 i=i 

(x^ D^/^TT^/^)^ Xn'^i'Y^ 

3=2 i=l 




= y^W^TTj. 

i=l 


□ 


Theorem 17 (Convexity of the Spectral Radius in /i). 
In the case where L loci mutate independently at rate pL, 
each in a reversible Markov chain, then for 0 < p, < 1I2, 
r(M(^)D) = 

max x^ 

x"^ X=1 

is convex in fi. 

Proof. For the situation where L loci mutate indepen¬ 
dently at rate /x, and locus ^ has mutation distribution 
matrix the mutation matrix for the entire genome 
is 

L L 

M(ai) = (g) [( = <S> )] ■ 

For the case where mutation at each locus forms a re¬ 
versible Markov chain. 
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Then by the Rayleigh theorem, 
r{M{n)T>) = 


r L 


max 

x=l 


0 

?=i 


( 22 ) 

di/4. 


The total number of genotypes is IG] := n{=i j where 
is the number of alleles at the ^-th locus. As a way 
to index the genotypes for summation we can use lexi¬ 
cographical order, 

if: {1,... ,ni} X •• • X {1,... ,nL} ^ {1,..., |G|}, 

where we use the notation j = j 2 , • ■ • > Jl) S 

and := 

Now ([22| can be expressed as 


r(M(4D) 

|G| L 

E n<i-»‘+Af) 




(xTDl/2[K],)^ 


44)=n ( 


1 - /i+) = 




nMi-A'f) 


€=i 


d 


, (4 = E-(1-4411 [1-41-Af)]. 


To have 1 - ^(1 - A^f) > 0 for all j^, the upper bound ^ ^ ^ 3 ) 0 ). 


since for irreducible with real eigenvalues. Perron- 

(k) 

Frobenius theory gives 1 > > — 1- 

It takes only one positive term in the sum to make 
d^ 

positive. The only way to get all the terms 

to be zero is for = 1 or = 1 for every distinct pair 
of loci {k, 7 } C {1,..., L}, which requires that index j 
have only one ^ 1. For all the j with at least two 
loci where ^ 1 and yf 1 , the term /3j(/r) is strictly 
convex in /r. For the other j where = 1 except for 
at most one locus, is a straight line. From this 

convexity we know that for any j S {1,...,|G|}, for 
/iG ( 0 , 1 ), 

(1 - h)j3j{fii) + h/3j{^i2) > 4((1 “ + 42 )- 

Now we return to the full expression, 

|G| 

r(M(4D) = max E4(4(x^Di/2[K],f. 

X ' X=1 


i=i 


where [K], is the j-th column of K = <^> 4=1 De¬ 
fine 


For a given x, each term (x^is nonnegative. 


so 


|G| r 

E 


(i-44(Mi) + 4i(M2) 


|G| 


(x^ D1/2[K],)2 


First we will see that each 4(/i) is convex in /r. The 
case L = 1 is subsumed in Result We assume L>2. 
The first derivative is 


The second derivative is 
d" 

= E -(1 - A‘:>) E -d - ^1:’) n - ^®)i 

K — 1 

n p-xd - n. 

K. — 1 

(23) 

where for L = 2 there is no term 7 [ 1 ~m (1 ~ A^^]- 
All of the terms (1 — a(” 4 (1 ~ nonnegative. 


= (1-/i)E4(/^i)(4d1/2[K],)2 
i=i 
|G| 

-f/iE4(A*2)(xTDi/2[K],)2 
i=i 
|G| 


^E 

1=1 


/3j((l - h)fii -I- V 2 ) 


(xTd 1 / 2 [K],)^ 


Hence, for any /ri,/i 2 G (0,1/2), and h G (0,1), 

(1 - h) r(M(/ri)D) + /r r(M(/r 2 )D) 

|G| 

= (I-/ 1 ) max E4 (Mi)( 4 
i=i 
|G| 

-I- h max E 4 (M 2 )(x^ 

X*^X =1 ' ^ 


i=i 


|G| 


> max E 4((1 “ + 42 )(x^ 

x*’'x=l 

i=i 


on IS 


1 


fj, := mm ■ 


1 — 1 — min,-., A 


Therefore, r(M(^)D) is convex in fj,. With additional 
> 1/2, (24) care, the conditions for strict convexity can be elicited. 

''Ji □ 
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Result 18 (House of Cards Mutation). Let M(/i) = 
(1 — fj,)I + fLP, where P = 7re^ . Then 

n 

7’(M(/r)D) < (1—/r) max[r(;i] + fJ,y T^iWi. 

Proof. 

r(M(/r)D) = r([(l — ^)D + D) 

= r([(l - /r)D + 

= max [(l-/i)D + Dy^Di/^jx 

v"’" V=;1 


robustness. Let genotype 1 (111) be favored by muta¬ 
tion bias, where the transition matrix of the reversible 
Markov chain is 


0 

3/4 

3/4 

0 

3/4 

0 

0 

0 

1/3 

0 

0 

1/3 

0 

1/3 

0 

0 

1/3 

0 

0 

1/3 

0 

0 

1/3 

0 

0 

1/8 

1/8 

0 

0 

0 

0 

1/3 

1/3 

0 

0 

0 

0 

1/3 

1/3 

0 

0 

1/8 

0 

0 

1/8 

0 

0 

1/3 

0 

0 

1/8 

0 

1/8 

0 

0 

1/3 

0 

0 

0 

1/3 

0 

1/3 

1/3 

0 


= max 

xV x=l 


= max 

xV x=l 


(l-M)^iT.a.? + A^(x^Di/2Dy2e) 
2=1 

n / ^ ^ 

(1-^) ^ Wixl + M I X] 


2=1 


\i=l 


< max fi max ( Xj^'KiWj 

X X XX 

n 

= (1-^) max[wi] + M TTiWi, 
i • ^ 


^2=1 


Genotypes 2,3,5 (101,110,011) are its neighbors. 
Genotypes 2,3,4 (101,110,100) will be off the neutral 
network, so the htnesses are 

e^D = (l,0,0,0,1,1,1,1). 


Then the equilibrium population neutrality is less 
than the average neutrality of the neutral network: 

0 ^ DlVIDe 

r(MD) = 0.6517 < —-= 0.6667. 

^ ^ e^De 


the last term coming from Lemma [TSl 


□ 


Counterexamples to Increased 
Mutational Robustness 

For neutral networks with non-symmetric mutation ma¬ 
trices, it is no longer true that population must con¬ 
centrate on genotypes with above average mutational 
robustness. Two counterexamples are provided. 

Counterexample 1: Cyclic Mutation 

Consider a space of three genotypes, two of which form 
a neutral network, with cyclic mutation 


'0 

1 

o' 


'1 

0 

o' 

0 

0 

1 

,D = 

0 

1 

0 

1 

0 

0 


0 

0 

0 


The neutral network genotypes have average offspring 
fitness e^DMDe/ De = 1 — p,/2 > r(MD) = 1 — /i. 

Counterexample 2: Biased Reversible 
Mutation 


Consider a mutation matrix of three loci with two- 
alleles, under single-event mutation. Suppose that mu¬ 
tation is biased towards a genotype with low mutational 
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